LOCUS pDONR™/Zeo DISCLAIMER Certain terms are trademarks or registered trademarks of Invitrogen Corporation. See "Intellectual Property" in the Help file for more information. FEATURES Location/Qualifiers misc_feature complement(268..295) /note="rrnB T2 transcription termination sequence (c)" misc_feature complement(427..470) /note="rrnB T1 transcription termination sequence (c)" primer_bind 537..552 /note="M13 Forward (-20) priming site" misc_recomb 570..668 /label=attL1 /note="attL1" misc_recomb complement(1709..1805) /label=attL2 /note="attL2 (c)" misc_signal complement(1820..1839) /note="T7 Promoter/priming site (c)" primer_bind 1847..1863 /note="M13 Reverse priming site" gene 1976..2785 /note="Kanamycin resistance gene" rep_origin 2906..3579 /note="pUC origin" vector join(1717..3582,1..651) /source="pDONR%99221" /type="Donor Vector" misc_feature 669..691 /note="TEV site" source 692..1702 /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:97246 IMAGE:7262495" /tissue_type="PCR rescued clones" /clone_lib="NIH_MGC_244" /note="Vector: pPCR-Script Amp SK(+)" gene 692..1702 /gene="HS3ST3B1" /gene_synonym=30ST3B1 /gene_synonym=3OST3B1 /db_xref="GeneID:9953" /db_xref="MIM:604058" CDS 692..1702 /dnas_title="heparan sulfate D-glucosaminyl 3-O-sulfotransferase 3B1" /gene="HS3ST3B1" /gene_synonym=30ST3B1 /gene_synonym=3OST3B1 /codon_start=1 /product="heparan sulfate D-glucosaminyl 3-O-sulfotransferase 3B1" /protein_id="AAH69725.1" /db_xref="GI:46854904" /db_xref="GeneID:9953" /db_xref="MIM:604058" /translation="MGQRLSGGRSCLDVPGRLLPQPPPPPPPVRRKLALLFAMLCVWL YMFLYSCAGSCAAAPGLLLLGSGSRAAHDPPALATAPDGTPPRLPFRAPPATPLASGK EMAEGAASPEEQSPEVPDSPSPISSFFSGSGSKQLPQAIIIGVKKGGTRALLEFLRVH PDVRAVGAEPHFFDRSYDKGLAWYRDLMPRTLDGQITMEKTPSYFVTREAPARISAMS KDTKLIVVVRDPVTRAISDYTQTLSKRPDIPTFESLTFKNRTAGLIDTSWSAIQIGIY AKHLEHWLRHFPIRQMLFVSGERLISDPAGELGRVQDFLGLKRIITDKHFYFNKTKGF PCLKKAEGSSRPHCLGKTKGRTHPEIDREVVRRLREFYRPFNLKFYQMTGHDFGWD" misc_difference 1165..1165 /gene="HS3ST3B1" /gene_synonym=30ST3B1 /gene_synonym=3OST3B1 /note="'C' in cDNA is 'T' in the human genome; no amino acid change. The chimpanzee genome agrees with the cDNA sequence, suggesting that this difference is unlikely to be due to an artifact." misc_feature 692..1702 /note="HS3ST3B1 coding region" ORIGIN 1 CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA 61 TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA 121 GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA 181 CGACAGGTTT CCCGACTGGA AAGCGGGCAG TGAGCGCAAC GCAATTAATA CGCGTACCGC 241 TAGCCAGGAA GAGTTTGTAG AAACGCAAAA AGGCCATCCG TCAGGATGGC CTTCTGCTTA 301 GTTTGATGCC TGGCAGTTTA TGGCGGGCGT CCTGCCCGCC ACCCTCCGGG CCGTTGCTTC 361 ACAACGTTCA AATCCGCTCC CGGCGGATTT GTCCTACTCA GGAGAGCGTT CACCGACAAA 421 CAACAGATAA AACGAAAGGC CCAGTCTTCC GACTGAGCCT TTCGTTTTAT TTGATGCCTG 481 GCAGTTCCCT ACTCTCGCGT TAACGCTAGC ATGGATGTTT TCCCAGTCAC GACGTTGTAA 541 AACGACGGCC AGTCTTAAGC TCGGGCCCCA AATAATGATT TTATTTTGAC TGATAGTGAC 601 CTGTTCGTTG CAACACATTG ATGAGCAATG CTTTTTTATA ATGCCAACTT TGTACAAAAA 661 AGCAGGCTct gaaaacttgt actttcaagg ctcctgcgcc gccgcgccgg ggctgctgct 721 cctgggctct gggtcccgcg ccgcacacga cccgccagcc ctggccacag ctccggacgg 781 gacgcccccc aggctgccgt tccgggcgcc gccagccacc ccactggctt caggcaagga 841 gatggccgag ggcgctgcga gcccggagga gcagagtccc gaggtgccgg actccccaag 901 ccccatctcc agctttttca gtgggtctgg gagcaagcag ctgccgcagg ccatcatcat 961 cggcgtgaag aagggcggca cgcgggcgct gctggagttt ctgcgcgtgc accccgacgt 1021 gcgcgccgtg ggcgccgagc cccatttctt cgatcgcagc tacgacaagg gcctcgcttg 1081 gtaccgggac ctgatgccca gaaccctgga cgggcagatc accatggaga agacgcccag 1141 ttacttcgtc acgcgggagg cccccgcgcg catctcggcc atgtccaagg acaccaagct 1201 catcgtggtg gtgcgggacc cggtgaccag ggccatctcg gactacacgc agacgctgtc 1261 caagcggccc gacatcccca ccttcgagag cttgacgttc aaaaacagga cagcgggcct 1321 catcgacacg tcgtggagcg ccatccagat cggcatctac gccaagcacc tggagcactg 1381 gctgcgccac ttccccatcc gccagatgct cttcgtgagc ggcgagcggc tcatcagcga 1441 cccggccggg gagctgggcc gcgtgcaaga cttcctgggc ctcaagagga tcatcacgga 1501 caagcacttc tacttcaaca agaccaaggg cttcccctgc ctgaagaagg cggagggcag 1561 cagccggccc cattgcctgg gcaagaccaa gggcaggacc catcctgaga tcgaccgcga 1621 ggtggtgcgc aggctgcgcg agttctaccg gcctttcaac ctcaagttct accagatgac 1681 cgggcacgac tttggctggg atTAGGACCC AGCTTTCTTG TACAAAGTTG GCATTATAAG 1741 AAAGCATTGC TTATCAATTT GTTGCAACGA ACAGGTCACT ATCAGTCAAA ATAAAATCAT 1801 TATTTGCCAT CCAGCTGATA TCCCCTATAG TGAGTCGTAT TACATGGTCA TAGCTGTTTC 1861 CTGGCAGCTC TGGCCCGTGT CTCAAAATCT CTGATGTTAC ATTGCACAAG ATAAAATAAT 1921 ATCATCATGA ACAATAAAAC TGTCTGCTTA CATAAACAGT AATACAAGGG GTGTTATGAG 1981 CCATATTCAA CGGGAAACGT CGAGGCCGCG ATTAAATTCC AACATGGATG CTGATTTATA 2041 TGGGTATAAA TGGGCTCGCG ATAATGTCGG GCAATCAGGT GCGACAATCT ATCGCTTGTA 2101 TGGGAAGCCC GATGCGCCAG AGTTGTTTCT GAAACATGGC AAAGGTAGCG TTGCCAATGA 2161 TGTTACAGAT GAGATGGTCA GACTAAACTG GCTGACGGAA TTTATGCCTC TTCCGACCAT 2221 CAAGCATTTT ATCCGTACTC CTGATGATGC ATGGTTACTC ACCACTGCGA TCCCCGGAAA 2281 AACAGCATTC CAGGTATTAG AAGAATATCC TGATTCAGGT GAAAATATTG TTGATGCGCT 2341 GGCAGTGTTC CTGCGCCGGT TGCATTCGAT TCCTGTTTGT AATTGTCCTT TTAACAGCGA 2401 TCGCGTATTT CGTCTCGCTC AGGCGCAATC ACGAATGAAT AACGGTTTGG TTGATGCGAG 2461 TGATTTTGAT GACGAGCGTA ATGGCTGGCC TGTTGAACAA GTCTGGAAAG AAATGCATAA 2521 ACTTTTGCCA TTCTCACCGG ATTCAGTCGT CACTCATGGT GATTTCTCAC TTGATAACCT 2581 TATTTTTGAC GAGGGGAAAT TAATAGGTTG TATTGATGTT GGACGAGTCG GAATCGCAGA 2641 CCGATACCAG GATCTTGCCA TCCTATGGAA CTGCCTCGGT GAGTTTTCTC CTTCATTACA 2701 GAAACGGCTT TTTCAAAAAT ATGGTATTGA TAATCCTGAT ATGAATAAAT TGCAGTTTCA 2761 TTTGATGCTC GATGAGTTTT TCTAATCAGA ATTGGTTAAT TGGTTGTAAC ACTGGCAGAG 2821 CATTACGCTG ACTTGACGGG ACGGCGCAAG CTCATGACCA AAATCCCTTA ACGTGAGTTA 2881 CGCGTCGTTC CACTGAGCGT CAGACCCCGT AGAAAAGATC AAAGGATCTT CTTGAGATCC 2941 TTTTTTTCTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT 3001 TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC 3061 GCAGATACCA AATACTGTTC TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC 3121 TGTAGCACCG CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG 3181 CGATAAGTCG TGTCTTACCG GGTTGGACTC AAGACGATAG TTACCGGATA AGGCGCAGCG 3241 GTCGGGCTGA ACGGGGGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTACACCGA 3301 ACTGAGATAC CTACAGCGTG AGCTATGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC 3361 GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG 3421 GGGAAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG 3481 ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT 3541 TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG TT //